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This paper analyzes a block-coding scheme designed to suppress 
spectral energy near f = for any binary message sequence. In this 
scheme, the polarity of each block is either maintained or reversed, de- 
pending on which decision drives the accumulated digit sum toward zero. 
The polarity of the block's last digit informs the decoder as to which deci- 
sion was made. 

Our objective is to derive the average power spectrum of the coded signal 
when the message is a random sequence of -\-l's and —Vs and the block 
length (M) is odd. The derivation uses a mixture of theoretical analysis 
and computer simulation. The theoretical analysis leads to a spectrum 
description in terms of a set of correlation coefficients, \p q \, q = 1, 2, etc., 
with the p g ' s functions of M. The computer simulation uses FFT algorithms 
to estimate the power spectrum and autocorrelation function of the block- 
coded signal. From these results, {p g } is estimated for various M. A 
mathematical approximation to p g in terms of q and M is then found which 
permits a closed-form evaluation of the power spectrum. Comparisons 
between the final formula and simulation results indicate an accuracy of 
±5 percent (±0.2 dB) or better. 

The block-coding scheme treated here is of particular interest because of 
its practical simplicity and relative efficiency. The methods used to 
analyze it can be applied to other block-coding schemes as well, some of 
which are discussed here for purposes of comparison. 

I. INTRODUCTION 
1.1 Block coding 

In its most general meaning, block coding consists of dividing a 
digital sequence into time-contiguous blocks and performing a sepa- 
rate coding operation on each block. In actual usage, the term is 

1103 



most often applied to cases in which both the original and block- 
coded sequences are binary and the coding serves either (i) to enable 
error detection and/ or correction or (m) to shape the digital sequence 
spectrum. 

The most widely used spectrum-shaping codes are those that sup- 
press the energy near zero frequency. This suppression enables the use 
of transformers and ac-coupled amplifiers in processing the digital sig- 
nal and, in modulation applications, provides for a null region near the 
carrier frequency to facilitate carrier extraction. Our concern here is 
with this kind of block coding. 

In particular, we examine a block-coding approach invented (in 
analog form) by F. K. Bowers. 1 The digital version of this scheme has 
been treated separately by Carter 2 and Pierce, 3 and implemented 
recently by Ruthroff and Bodtmann. 45 The scheme can be used with 
blocks of either odd or even length (M), but our attention here is con- 
fined to odd M. Our objective is to derive the average power spectrum 
of a sequence so coded when the original message sequence is totally 
random. This problem has been partially studied by Rice 6 for the same 
block coding with M even (to which case our method of analysis is 
equally applicable), and by Slepian 7 and Franklin and Pierce 8 for 
other dc-suppressing block codes. Results for some of these cases are 
given later. 

1.2 Description 

Unless otherwise specified, the term block coding means the process 
we describe here, with the aid of Figs. 1 and 2. 

The original sequence of binary digits is divided into blocks of length 
(M — 1), and a +1 digit (the so-called code digit) is added to the end 
of each block (Fig. la). A resettable counter measures the digit sum 
Ak in each block k, omitting the code digit if M is even and including 
that digit if M is odd. In either case, this count can take on only odd 
values. It is compared with the sum over all previous output digits, 
B k , and a decision is made as follows: li Ak and B k have the same 
polarity, all pulses in block A; are inverted; if Ak and Bk have opposite 
polarity, the pulses in block k are unaltered ; and if B k = 0, its polarity 
is taken to be that of its most recent nonzero value, i.e., Bt-i, or Bk-2, 
etc. * In the decoder, each received block is inverted if the polarity of 



* There are other ways to resolve the case Bk = (e.g., by random decisions, 
as suggested by Rice), and the ultimate choice should be dictated by practical 
considerations. 
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Fig. 1. — Sample input and output digital sequences. 

the recovered code digit is negative and is not inverted if that digit is 
positive. 

Figure 2 depicts the logical process just described and is a simplified 
diagram of how block coding is actually implemented. The identifica- 
tion of the code digit in the decoder is accomplished with the help of 
framing, which is not depicted (or treated) here. The penalties in this 
form of block coding are a 100/M-percent reduction in information 
rate and twofold increase in the random error rate. 
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Fig. 2 — Coding process. 
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1.3 Scope of the paper 

It is easy to see that the accumulated sum of the output digits at the 
end of any block is limited in magnitude to M (i.e., \Bk\ ^ M), and 
that the magnitude of this sum at any instant in time is limited by 
(3M - l)/2 if M is odd and 3ilf/2 if M is even. For this reason, the 
output sequence has no spectral energy — either discrete or con- 
tinuous — at / = 0. At the same time, the total sequence "power" is 
unchanged by the coding, since every digit has the same "energy," 
regardless of polarity.* Obviously, then, the suppressed energy near 
/ = is redistributed over the rest of the frequency range, and it is 
of more than passing interest to know how. 

The answer, of course, depends on the nature of the message se- 
quence being encoded. In this study, we assume a totally random se- 
quence (all digits independent, with equally likely polarities) and 
derive the block-coded signal spectrum for odd values of M. The 
derivation uses a mixture of theoretical analysis and computer simula- 
tion and leads to a closed-form expression for the spectrum which com- 
pares quite favorably with simulation data. Section II gives the purely 
theoretical part of the derivation, Section III describes the simulation 
study, and Section IV gives the final result and some examples. 

II. ANALYSIS 

2.1 General form of the spectrum 

We represent the uncoded message sequence as a binary stream of 
pulses at a rate 1/T, 

Si(t) = £ a„p(t-nT), (1) 

where 

a„ = + 1 or — 1 with equal probabilities, 

lifn = m (2) 

if n 9* m, v ' 



dndm = 



and p(t) is a pulse centered on [0, T~] of arbitrary shape, area T, and 
Fourier transform P(f). The first step in the coding consists of opening 
up a one-pulse slot after every M — 1 message pulses and injecting a 
positive pulse, -\-y{t). The new sequence, with positive code pulses 



* In the ensuing analysis, the digital sequence of Fig. 1 is replaced by a pulse stream 
at a rate 1/T, with each pulse having an area of magnitude T. 
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every M time slots, is then block coded to produce an output signal 

«.(*) = f b n p(t- nT). (3) 

It is the average power spectrum of s (t) that we wish to evaluate. 
We define the autocorrelation function of the coded signal to be 



R(t) = lim 



si /"' s (t)8o(t + r)(tt (4) 

iio J -To 

and the average power spectrum to be the Fourier transform of R(r), 

S(f) = 5{R(t)\ = f° B(t) exp (-jwr)dT. (5) 

To simplify the derivation of £(/), it is convenient to express s„(t), 
eq. (3), as the convolution 



s (t) =\ t, b n 8(t-nT) 



*v(t), 



(6) 



Su(f,T) 



where 5(t) is the unit impulse function. It is now obvious that S(f) is 
the product 

S(f) = SM;T)\P(f)\*, (7) 

where <S„(/; T) is the average power spectrum of s u (t; T) in (6), and 
P(f) is the Fourier transform of p(t). 

We can obtain £„(/; T) by applying (4) and (5) to s u (t; T). In so 
doing, we make use of the fact that the convolution between two unit 
impulse functions separated by mT seconds is a unit impulse function 
8(t — mT). It is then easy to show that 



SM; T) = »{ \ m t m [ Ihn ^ i_ N KK +m }(r - mT)\, 



(8) 



Ru(r;T) 



where b n b n+m is an average over the sequence ensemble and the 
bracketed term is the further averaging over the time position of b„. 
Because R u (t; T) is a sequence of uniformly spaced impulses, £„(/; T) 
is periodic in frequency with a repetition interval l/T. The shaping of 
this spectrum by the nonperiodic pulse spectrum function |P(/)| 2 
leads to the overall spectral characteristic of the block-coded signal. 
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2.2 Analysis of /?„(?-; T) 

This analysis is aimed at simplifying R u (t; T), eq. (8), by finding a 
description for b„b„ +m and its average over n. To do this, we make two 
important observations about the block-coded binary sequence \b n ) : 

(i) The 2 M possible digital sequences within each code block of 
\b n \ are equiprobable, even though the last digit is a code digit. 
{ii) The correlation between 6„ and 6„ +m depends only on the num- 
ber of blocks separating these two digits, i.e., on the number (q) 
of code digits in the interval [n, n + m). 

The first observation is easily proved : The first M — 1 digits of each 
block at the coder input, which are assumed to be totally random, form 
one of 2 M ~ l equiprobable sequences. With the addition of the +1 code 
digit, there are still only 2 M ~ l realizable sequences per block. The 
possible inversion of the block by the coder, however, produces another 
2 M ~ 1 realizable sequences (the original 2 M ~ l sequences with —1 instead 
of +1 for the last digit), leading to a total of 2 M . Further, since the 
probability of a block inversion is \ for a random input sequence, the 
2 M realizable output sequences are equiprobable. The significance of 
this is that \b„ \ is statistically the same as if all M digits in each input 
block were derived by random selection. 

The second observation depends on the first. For in the absence of 
block inversions and with all input digits randomly derived, there 
would be no correlation between any two digits of the digital stream. 
Any correlations in the block-coded sequence, therefore, are due solely 
to the inversions. It follows that b„b„+ m depends, at most, on the num- 
ber of possible block inversions (or code digits, q) between b n and b n+m . 

We conclude that R u (t; T) can be expressed in terms of a set of 
numbers {p q \, p q being the correlation between any two digits having 
q code digits between them. * To reduce (8) to such a representation, we 
first observe that, if \m\ = IM + p, where 1 S p ^ M, then q is 
either I or I + 1, depending on the position of b„ within the block con- 
taining it. By letting n vary from the first to the last block position, we 
can see that q = I for a fraction [_{M — p)/M~] of all possible positions, 
and q = I + 1 for a fraction (p/M) of all possible positions. We can 
therefore express the bracketed quantity in (8) as 



fe^X^-l^VJ^+ltf)^ 



(9) 



* It is obvious that po = 0, because any two digits in the same input block are 
taken to be uncorrelated, and this fact is not altered by the coding. 
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Fig. 3 — Representation of R(m; M). 
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(10) 



Since p = \m\ — IM, the right side of (9) can be re-expressed as 

n , ir , (I + \)M - \m\ \m\ - IM 
R(m ; M) ^ K ^ ' n , L_! P| + i-A__ Pi+v 



M 



M 



(ID 



with I related to m and M by (10). 

It is clear from the statistical symmetry of the b's in (9) that 
R{ — m;M) = R{m;M), and (11) reflects this fact. It is also clear 
from (9) that R(0; M) = 1. Wo can thus express R u (t; T) in (8) as 

R„(r; T) 

= y[*(r) + t i R(»i;M){5(r + mT) +«( r -mT))J. (12) 

Although the mathematical description for S(w ; M), eq. (11), seems 
complicated, it has the very simple graphical interpretation shown in 
Fig. 3. The value of R{m; M) for q complete block separations (i.e., 
m = ±qM) is just p q ,' and the variation between >n = qM and m 
= (q + Y)M is a linear progression from p q to p,+i. This result can 
now be used to derive S u (f; T). 



' The one exception to this is the singular case q = 0, where R (m = ; M) = pa + 1, 
with po = 0. 
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2.3 Expression for S u (f; T) 

If the lines in Fig. 3 are envisioned as impulses of area 1/T separated 
by T seconds, then &„(/; T) can be found as the Fourier transform of 
this sequence. The algebra is straightforward but somewhat tedious, 
and so we give just the final result: 

8.U; T) = i[l + 2m(|^/2 Y £ ,.„».*!■]. (13) 

The periodicity of this function, with repetition interval 1/T, is easy 
to see. The p g 's are functions of M so that a complete description of 
&"(/; T), and thus of S(f) as given by (7), reduces to knowing the 
array of functions \p q {M)\. Unfortunately, there is no apparent way 
to determine these functions from purely theoretical considerations. 
One useful bit of information, however, is that S u (0; T) = by virtue 
of the block coding.' This being the case, we see from (13) that 

I, *<*> - " m < 14 > 

Beyond (7), (13), and (14), we have little information about the 
block-coded signal spectrum on theoretical grounds. Using the methods 
of computer simulation, however, it is possible to estimate the p q 's 
for various M, and to seek functional descriptions for them that permit 
a closed-form evaluation of (13). This task constitutes the remainder 
of the development. 

III. SIMULATION STUDY 
3.1 Computer programs 

The computer programs used to derive {p g (M)\ empirically are 
depicted in Fig. 4. The routine called BLOCK generates random se- 
quences \a„\ having the properties described by (2) and, for specified 
M, converts them into block-coded sequences {&,,} by emulating the 
logic in Fig. 2. These coded sequences are supplied on demand to the 
main program, labelled SIMULATION. 

The SIMULATION program operates in the following manner: In 
each of Nt trials, it accepts an JV-term sequence from BLOCK and 
performs an iV-point discrete Fourier transform (DFT 9 ), producing 
complex spectral samples at / = k/NT, k = 0, N — 1. The squared 
magnitude of the &th sample (normalized by N) represents a one-trial 



* This is so because the long-time integration of the coded sequence is bounded in 
magnitude (specifically, by 3M/2). 
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Fig. 4 — Computer simulation program. 



estimate of TS u (f = k/NT; T). These estimates for the N T trials are 
averaged to produce the array {£*}, k = 0, N — 1. This procedure is 
made efficient by implementing the DFT's with fast Fourier transform 
(FFT) algorithms. 10 To maximize efficiency, N is constrained in all 
simulations to be an integral power of 2. 

The full benefit of the multitrial averaging is obtained by enforcing 
statistical independence among the iVr sequences supplied by BLOCK, 
and also by effectively randomizing the time phase of the sequences 
analyzed. The latter is accomplished by means of the function labelled 
"SELECT SUBSEQUENCE . . ." (Fig. 4), which causes the starting 
time of the analyzed sequences to vary uniformly among the M possible 
positions within a block. To accommodate this feature, the independent 
sequences supplied by BLOCK have a total length iV + N r — 1 or 
greater. 

There are three output arrays produced in the SIMULATION pro- 
gram. One is \S k \, which approximates TS u (f; T) at the N frequencies 
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k/NT, k = 0, N - 1. This array is applied to the PLOT routine, Fig. 
4, where it is plotted on the same graph as the mathematical function 
given by (28). This function is derived using numerical parameters 
extracted from \R m }, the inverse DFT of {$*}, which is the second 
output array, Fig. 4. The elements of this array represent estimates of 
R(m ; M), i.e., R m = R(m ; M), and they are applied to the COMPUTE 
routine to produce estimates of p q , i.e., the array {p q \. These are the 
quantities used to derive (28) from (13). To evaluate the accuracy of 
these estimates, the COMPUTE routine also produces the array {& q ), 
where & q is the approximate rms error in p q . The computation of {& g } 
involves the array { R™ j , the inverse DFT of { Si } , which is the third 
output of the SIMULATION program. The formulas relating {R m } ; 
{pq}, {R™ }, and \& q ) are presented in Section 3.3. 

3.2 Choices of N and N T 

The difference between the computer-derived spectral sample Sk 
and the quantity it approximates, TS u (k/NT; T), contains two dis- 
tinct components, (i) a deterministic error due to the finiteness (N) of 
the sample length and (ii) a random error due to the finiteness (Nt) 
of the number of independent simulations. Similar remarks apply to 
the difference between R m and R (in ; M ). We now apply these considera- 
tions to the choices of iV and Nt. 

Because iV is finite, the normalized spectrum estimated by the com- 
puter program is not TS u (f; T), but the convolution between 
TS u (f; T) and the function 

Thus, S k is an estimate of the quantity 

TS;( jfj,; T^ = £ TSuU; T)F^f - ^ )d/. (16) 

Since the area of F(f) is unity, the approximation of TS u (k/NT; T) 
by TSu(k/NT; T) is very good if TSu(f, T) changes negligibly over 
the main lobe of F(f — k/NT). The difference is the deterministic 
error in Sk] the inverse DFT of the fc-sequence of these errors gives 
the deterministic errors in the estimates of R(m ; M). 

By considering the interference between the peak of TS u (f;T) 
(which occurs near / = 1/4MT) and the sidelobes of F(f) (which 
decrease as l// 2 ), we have determined a rule of thumb for which worst- 
case deterministic errors are negligibly small. The rule constrains iV 
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to the region 

N ^ 100 M, (17) 

a constraint we have used throughout this study. 

The random errors in estimating R{m;M) also decrease with in- 
creasing N, as we shall see in Section 3.3. This consideration, added to 
(17) and the requirement that N be an integral power of 2, has helped to 
shape the final choices of N for different values of M. Typically, we 
have used N = 512 for M = 3, N = 2048 for M = 9, and N = 4096 
for M = 17. 

We shall also see in Section 3.3 that the random errors in estimating 
S„(/; T) and R(m;M) decrease with increasing N T . For example, the 
fractional rms error in S k is accurately given by 1/ViVr. In deriving 
estimates for R(m;M), we have used 400 trials to achieve the ac- 
curacies desired, while, to obtain precise spectral estimates for com- 
parison with the final formula, we have used 1600 trials (corresponding 
to ±2.5 percent accuracy). 

3.3 Analysis of computer results 

The simulation estimates of R(m ; M) for m > and M = 3, 9, and 
17 are given by the points in Figs. 5, 6, and 7. The existence of straight- 
line variations between m = qM and m = (q + 1)M for q = 0, 1, 2, 
etc., as predicted by the analysis of Section II, is evidenced here. The 
deviations of the points from straight lines are due to statistical fluc- 
tuations in the finite simulation, and the straight lines shown are 
derived from the points by least-squares techniques. The pertinent 
error analyses and estimation procedures used to obtain these straight 
lines and further data reductions are now summarized. We assume 
from here on that deterministic errors are made negligible by the 
choice of N, i.e., that all errors in S k and R m are random errors due to 
finite N T . Table I lists the symbols to be used. 

3.3.1 Error correlations 

We now establish the error correlations 8 k 8 t and e m e p with the aid of 
the definitions in Table I. As N becomes very large, the real and 
imaginary parts of u k become more and more like independent gaussian 
variates (central limit theorem 11 ), and we assume this to be the case 
here. The importance of this assumption is that the definitions of S k , 
S k , and S~ k in Table I can then be used to obtain 



8k8 < = wr (18) 
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Fig. 5— Simulation results for M = 3 (N = 512, N T = 400 trials). 
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Fig. 6— Simulation results for M = 9 (JV = 2048, N T = 400 trials). 
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Fig. 7— Simulation results for M = 17 (N = 4096, N T = 400 trials). 



Clearly, 5* = (Sic) 2 /Nt, i.e., the rms error in estimating S k is o*/"\lVj». 

Unfortunately, 5*5* for I ^ k is not zero for the block-coded signal 
under study. Nevertheless, these correlations are found to be suffi- 
ciently weak (particularly as \k — l\ increases) that we can ignore 
them without any first-order effects on the results. The benefit of this 
is a considerable simplification in the mathematics. Accordingly, we 
shall assume that 

(S~ k ) 2 /N T ;l = k 
[0; l^k. 



8 k 8i = 



(19) 



Table I — Symbols used in error analysis 



Symbol 



Meaning 



{«*}, k = 0, N - 1 

S u (f; T) 

S k 

s; 

R(m;M) 
R n 

<m 

Pl> Pi 



DFTof \b n \, n = 0, N - 1 

Average power spectrum of infinitely long block-coded signal 
Simulation estimate of S u (f = kT/N ;T):S k = Ave(\u k \*/N) 



Limiting value of <S* as Nt — * °° : S k = 

Random error in S k : S k = S k — S k 

Coefficient, at separation m, of autocorrelation function of 

block coded signal 
Simulation estimate of R(m; M): \R m \ = IDFT|S*) 
Random error in R m : e m = R m - R(m; M) = IDFT|5*| 
Exact, estimated values of R(m = qM ; M) 
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To establish e m e p , we begin with the fact that { c m } is the inverse 
DFT of {5k} . Using (19) and a modest amount of manipulation, we can 
then show that 

a = t4t- [idft{ (^) 2 } | m+p + idft{ (s k y} \ ]m . P a (20) 



AW, 



/? (2 L Rf 2> ! 



In the SIMULATION program (see Fig. 4), the array {/2J 2) } is esti- 
mated by computing the inverse DFT of {Si}, and is then used to 
determine the rms errors in the estimates for p q (M). 

3.3.2 Estimations of Pq (M) 

Given the simulation estimates {R m }, an obvious way to estimate 
p q ( = R{m = qM; M)) is by p q = R m = g M. However, the values of R m 
for m = qM — 1, qM — 2, • • -qM — (M — 1) can be included to 
yield more accurate estimates of p q , as the following analysis shows. 

Using Table I and Fig. 3, we can express R m in the general form 

R m = p 9 -i + P " ~ M P9 ~ 1 [m - (q - DM] + e m ; (q - 1)M < m ^ qM 

q = 1, 2, etc. (21) 

Now suppose that we estimate p q as a linearly weighted sum of R m 
over [_{q — \)M -f- 1, qM~\. Using the substitution m' = m + (q — l)M, 
this sum can be written as 

M M 

Pq = E IVmRm' = Pq-1 E W m 
m=\ m =1 

. _ . M M 

+ J 9 " 1 Z wttf. + E i^ m e m «. (22) 

J« m =i m =i 

This estimate is made unbiased by choosing {iv m } so that 



M 

E Wm = 

m = l 



Arbitrary <7 = 1 and g ^ = M (23) 

1, q > 1 m=l 



(For the singular case 9=1, there is no constraint on £w m because p 
is known to be zero.) To see how to choose {iv m } (m = 1, M) within 
these constraints, we combine (20), (22), and (23) and obtain the 
following mean-square error for p q : 

MM O M M 

c\= E E w m w p 7&v = ^ E T, w m wJtR2?+, + Rf&sQ. (24) 
To a first approximation, the dominant component of o\ is 2R$ ) (Y,w m )/ 
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Table II — 


Estimates of p q (M) 




\M 


3 


5 


9 


13 


17 


q\ 


(<t=3.1X10- 3 ) 


(<r = 1.8X10-') 


(a = i.oxio- 3 ) 


(o-= 6.1X10-*) 


(<r=5.3XlO- 4 ) 


1 


-0.06321 


-0.04787 


-0.03098 


-0.02271 


-0.00172 


2 


-0.14363 


-0.06625 


-0.02870 


-0.01751 


-0.00128 


3 


-0.01754 


-0.00632 


-0.00168 


-0.00140 


-0.00081 


4 


0.07654 


0.02712 


0.00659 


0.00398 


0.00212 


5 


0.01431 


0.00515 


-0.00018 


0.00024 


0.00066 


6 


-0.04212 


-0.01120 


-0.00198 


-0.00077 


-0.00091 


7 


-0.01074 


-0.00200 


-0.00075 


0.00075 


-0.00041 


8 


0.02653 


0.00701 


0.0 


0.00002 


0.00103 


9 


0.00546 


-0.00018 


0.00069 


0.00039 


0.00088 


10 


-0.01244 


-0.00483 


-0.00026 


0.00069 


0.00020 


11 


-0.00600 


-0.00040 


-0.00016 


-0.00009 


-0.00050 


12 


0.01124 


0.00185 


-0.00103 


0.00017 


0.00061 


13 


0.00275 


0.00143 


0.00183 


0.00024 


-0.00031 


14 


-0.00471 


-0.00096 


0.00009 


0.00032 


-0.00017 


15 


-0.00062 


0.00022 


-0.00008 


0.00003 


0.00018 


16 


0.00038 


0.00200 


-0.00062 


0.00061 


0.00027 


17 


0.00171 


0.00016 


0.00013 


-0.00023 


0.00017 


18 


0.00027 


-0.00407 


-0.00087 


-0.00050 


0.00004 


19 


0.00254 


-0.00134 


0.00039 


0.00047 


-0.00019 


20 


0.00024 


0.00201 


0.00057 


0.00051 


0.00024 



NNt, because | Rn ] I tends to be small for n ?± 0. Using this fact, an 
approximate least-squares approach is to derive the sequence { w m } for 
which Yl w m is a minimum within the constraints of (23). Using Lagran- 
gian multipliers, it is straightforward to show that the solution is 



w m = 

(m=l,M) 



2 

- T7r + 



Qui 



M ' M(M + 1) 
Qm 



XM + l)(2M + 1) ' 



; q> 1 

9= 1. 



(25) 



We assume that, for practical purposes, (25) represents the least- 
squares coefficient array for the error given by (24). It was used in the 
COMPUTE routine of Fig. 4 to obtain [/$,} [based on (22) and the 
estimates {#»,)] and to estimate j a q \ [based on (24) and the estimates 
of |B»}]. 

The results are shown in Table II for several values of M and for 
q = 1, 20. It is found that cr q is fairly constant with q, except for o\, 
which tends to be lower by 10 to 30 percent. The quantity a in each 
column heading of Table II is the average of the computed <7 3 's from 
q = 2 to q — 20. * These rms errors are lower than those obtained 



* Note that, for all M, p q is in the simulation "noise" (i.e., \p q \ ^ a) for q ^ 20. 
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by estimating p q as R m =gM [which is equivalent to using \w m \ 
= (0, 0, • • • , 1)], the improvement factor increasing with M and 
having a value near 2.5 for M = 17. 

3.4 Reduction to mathematical descriptions 

From the data of Table II, a useful and valid description for p g (M) 
can be shown to be 

fcdO = jf + % , W 

where A q and B q are functions solely of q. For each q, raw estimates of 
A q and B q are derivable from the p q values at any two values of M. To 
satisfy (14) for all M, however, it is necessary that these estimates be 
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Fig. 8 — Possible solutions for {.4,1 and {B,\. 
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refined to satisfy 






?>.- "21' 2 B g = 0. (27) 



Moreover, these refined estimates should express A 9 and S g in mathe- 
matical forms that will permit a closed-form evaluation of (13). One 
possible solution that satisfies all these requirements is Fig. 8. Using 
these results in (26) and comparing with the values in Table II, agree- 
ment is found to be quite good : Nearly all the new estimates of p, 
obtained in this way lie within a standard deviation (±o-) of the tab- 
ulated values; also, the values of p q for q > 20 lie within ±<r about 
zero, decaying in magnitude with q as expected from physical reasoning. 

IV. FINAL RESULT AND EXAMPLES 

4.1 Expression tor S u (t; T) 

Combining Fig. 8 with (26) and (13), it is possible to obtain a 
closed-form expression for S u (f; T). Once again, the algebra is tedious 
but straightforward, so we merely state the result : 

where 

F(uT; M) = 0.645 cos (McoT) - ^ cos (2M w r) 

n oc cos (2MwT) - 0.2 1 

— U.85 



cos (2Mo)T) - 2.6 13 + 12 cos (2Ma>T) 
X f -| [-7.2 + 6.4 cos (MuT) - 10.8 cos (2MuT) 
+ 0.6 cos (3Af»!T)] + 0.05[12 cos (4MuT) + 2 cos (5MwT) 

+ 18 cos (6MaT) + 3 cos (7JWo)7 1 )]l ; M odd. (28b) 

Rather than do an error analysis of this result (e.g., based on the a's 
in Table II), we have compared this formula Avith fresh simulation 
results for {S k } based on 1600 trials (±2.5 percent rms error). The 
PLOT routine shown in Fig. 4 plots the simulation data as points and 
plots the formula as a solid line. Figures 9 through 13 give the results 
for M = 3, 5, 9, 13, and 17. Given the rms errors of the simulations 
and the scatter about the solid curves, we estimate from these com- 
parisons that the formula is accurate to within ±5 percent (±0.2 dB) 
or better for all M and w. The accuracy is especially good in the all- 
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important rising portions near / = 0, where the simulation points are 
seen to lie very close to — or within the line thickness of — the solid 

curves. 

4.2 Comparison with even-M block code 

It is tempting to extrapolate the new formula to the case of even M, 
although Fig. 2 warns us that the coding schemes for odd and even M 
are qualitatively different. Figure 14 shows simulation points, along 
with a solid curve derived from the new formula, for M = 4. Figures 
15 and 16 do the same for M = 8 and 16. Although N and N T are 
lower in these simulations, the consequent increases in the deterministic 
and random errors do not account for the observed discrepancies. It 
is concluded that the new formula is not accurate for low even values of 
M, but that its accuracy improves as M increases to large even values. 



i.- 1.6 




'•'•'■'•'•• SIMULATION RESULTS 
FORMULA 



0.2 0.3 

fT 



0.4 



0.5 



Fig. 9 — Comparison of formula with simulation results for M = 3 (Nt = 1600). 
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Fig. 10 — Comparison of formula with simulation results for M = 5 (Nt = 1600). 
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Fig. 11 — Comparison of formula with simulation results for M = 9 (N T = 1600). 
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Fig. 12 — Comparison of formula with simulation results for M = 13 (Nt = 1600). 

4.3 Comparison with zero-disparity code 

A different kind of block-coding scheme is one in which M is even 
and each block is constrained to have an equal number of positive and 
negative digits. 2 For a given M, this so-called zero-disparity code is 
less efficient in information rate than the one studied here (see Franklin 
and Pierce 8 ), but has superior spectral properties, as we now show. 

For the zero-disparity code, Franklin and Pierce show that TS U (/; T) 
is M/(M - 1) times the function (28a), with F(coT;M) replaced by 1. 
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Fig. 13 — Comparison of formula with simulation results for M = 17 (Nt = 1600). 
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Fig. 14 — Comparison of formula with simulation results for M = 4 (Nt = 400). 
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Fig. 15 — Comparison of formula with simulation results for M = 8 (Nt = 400). 
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| M= 16 
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Fig. 16 — Comparison of formula with simulation results for M = 16 (Nt = 400). 

For/ ^ 1/2T and M ^ 4, this result can be represented to within 0.3 
dB as follows : 



M - 



-Wi^i-p^)". m 



M 
This function is shown (dashed curve) in Fig. 17 and compared with 




Fig. 17 — Comparisons with zero-disparity block code. 
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Fig. 18 — Block-coded signal .spectrum for various pulse shapes (M = 9). 

some corresponding results for the block code studied here. The latter 
results, based on the new formula, are for M = 5, 9, and 17. The 
suppressed energy near / = is seen to be redistributed to higher fre- 
quencies (and in a more uniform way) by the zero-disparity code, 
permitting more relaxed requirements on ac-coupled processing stages 
for a given M . If the quantity held fixed is information rate, however, 
the zero-disparity code must use a larger value of M for which its 
spectral superiority all but vanishes. 

4.4 Effects of pulse shape 

The overall spectrum of the block-coded signal must take into ac- 
count the spectrum of the pulse shape p(t). Figure 18 gives some results 
for <S(/), (7), for the case M = 9. The relative differences due to pulse 
shape are identical to those that occur without block coding. The effect 
of the block coding is to force spectral nulls near / = n/T, (n = 0, 1, 2, 
etc.) and "bumps" within ±1/4MT of each null. 

V. CONCLUSION 

The general analysis of Section II leading to (7) and (13) applies to 
a wide class of block-coding schemes aside from the one treated here. 
The simulation/analysis procedures described in Section III can like- 
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wise be applied to these schemes to find the p g 's. Unfortunately, the 
computer costs involved in accurately estimating the p 5 's and com- 
paring the resulting formula with simulation data can be quite high. 

Aside from computer cost considerations, a strictly theoretical solu- 
tion to this kind of problem would be more accurate and provide more 
insight into the correlation factors influencing this kind of random 
process. Although qualitative explanations can be given for the os- 
cillating behavior of p q with q (Table II), the approach described here 
requires and offers little insight into such phenomena. 

In strictly practical terms, however, the result of this study provides 
a spectrum description which is quite accurate and fairly simple to 
use. For studies involving the passage of block-coded signals through 
ac-coupled amplifiers, or carrier extraction from signals modulated 
with block-coded sequences, such descriptions are highly useful. 
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